An Improved Search Algorithm for Optimal Multiple-Sequence Alignment

机译：一种改进的最优多序列对齐搜索算法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multiple sequence alignment (MSA) is a ubiquitous problem in computationalbiology. Although it is NP-hard to find an optimal solution for an arbitrarynumber of sequences, due to the importance of this problem researchers aretrying to push the limits of exact algorithms further. Since MSA can be cast asa classical path finding problem, it is attracting a growing number of AIresearchers interested in heuristic search algorithms as a challenge withactual practical relevance. In this paper, we first review two previous,complementary lines of research. Based on Hirschbergs algorithm, DynamicProgramming needs O(kN^(k-1)) space to store both the search frontier and thenodes needed to reconstruct the solution path, for k sequences of length N.Best first search, on the other hand, has the advantage of bounding the searchspace that has to be explored using a heuristic. However, it is necessary tomaintain all explored nodes up to the final solution in order to prevent thesearch from re-expanding them at higher cost. Earlier approaches to reduce theClosed list are either incompatible with pruning methods for the Open list, ormust retain at least the boundary of the Closed list. In this article, wepresent an algorithm that attempts at combining the respective advantages; likeA* it uses a heuristic for pruning the search space, but reduces both themaximum Open and Closed size to O(kN^(k-1)), as in Dynamic Programming. Theunderlying idea is to conduct a series of searches with successively increasingupper bounds, but using the DP ordering as the key for the Open priority queue.With a suitable choice of thresholds, in practice, a running time below fourtimes that of A* can be expected. In our experiments we show that our algorithmoutperforms one of the currently most successful algorithms for optimalmultiple sequence alignments, Partial Expansion A*, both in time and memory.Moreover, we apply a refined heuristic based on optimal alignments not only ofpairs of sequences, but of larger subsets. This idea is not new; however, tomake it practically relevant we show that it is equally important to bound theheuristic computation appropriately, or the overhead can obliterate anypossible gain. Furthermore, we discuss a number of improvements in time andspace efficiency with regard to practical implementations. Our algorithm, usedin conjunction with higher-dimensional heuristics, is able to calculate for thefirst time the optimal alignment for almost all of the problems in Reference 1of the benchmark database BAliBASE.

机译：多序列比对（MSA）是计算生物学中普遍存在的问题。尽管要为任意数量的序列找到最优解都是NP难的，但是由于这个问题的重要性，研究人员正试图进一步推动精确算法的极限。由于MSA可以看作是经典的路径查找问题，因此它吸引了越来越多的对启发式搜索算法感兴趣的AI研究人员，这是一种具有实际实际意义的挑战。在本文中，我们首先回顾了之前的两个互补研究领域。基于Hirschbergs算法，DynamicProgramming需要O（kN ^（k-1））空间来存储搜索边界和重建解路径所需的节点（对于k个长度为N的序列）。限制必须使用启发式方法探索的搜索空间的优势。但是，有必要将所有探索的节点保持到最终解决方案，以防止搜索以更高的成本重新扩展它们。减少“已关闭”列表的较早方法要么与“打开”列表的修剪方法不兼容，要么必须至少保留“已关闭”列表的边界。在本文中，我们提出了一种试图结合各自优点的算法。 likeA *它使用启发式方法修剪搜索空间，但是将最大打开和关闭大小都减小到O（kN ^（k-1）），就像在动态编程中一样。其基本思想是进行一系列搜索，并不断增加上限，但使用DP排序作为开放优先级队列的关键。实际上，通过选择适当的阈值，可以预期运行时间低于A *的四倍。。在实验中，我们证明了我们的算法在时间和内存方面均优于目前最成功的最优多序列比对算法之一，部分扩展A *。此外，我们不仅基于序列对，而且基于序列更大的子集。这个想法并不新鲜；但是，为了使其实用相关，我们证明适当地限制启发式计算同样重要，否则开销会消除任何可能的收益。此外，我们讨论了有关实际实现的时间和空间效率方面的许多改进。我们的算法与高维启发式算法结合使用，能够首次针对基准数据库BAliBASE的参考1中的几乎所有问题计算最佳对齐方式。

著录项

作者
Schroedl, S.;
展开▼
作者单位

展开▼
年度 2011
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. An Improved Search Algorithm for Optimal Multiple-Sequence Alignment [J] . Schroedl S. The Journal of Artificial Intelligence Research . 2005,第12期

机译：最优多序列比对的改进搜索算法
2. An Improved Search Algorithm for Optimal Multiple-Sequence Alignment [J] . Stefan Schroedl The Journal of Artificial Intelligence Research . 2005,第0期

机译：最优多序列比对的改进搜索算法
3. Optimal clustering of MGs based on droop controller for improving reliability using a hybrid of harmony search and genetic algorithms [J] . Abedini Mohammad, Moradi Mohammad H., Hosseinian S. M. ISA Transactions . 2016,第Null期

机译：基于下垂控制器的MG最优聚类，通过和谐搜索和遗传算法的混合来提高可靠性
4. An Optimal Alignment Search System for Highway Design using Genetic Algorithms and Three-Dimensional CAD [C] . M. Yamasaki, T. Mori, A. Kobayashi, International Conference on Civil, Structural and Environmental Engineering Computing . 2007

机译：使用遗传算法和三维CAD的公路设计优化对准搜索系统
5. Proactive search: Using outcome-based dynamic nearest-neighbor recommendation algorithms to improve search engine efficacy. [D] . Wagner, Christopher Shaun. 2014

机译：主动搜索：使用基于结果的动态最近邻居推荐算法来提高搜索引擎的效率。
6. An Experience Oriented-Convergence Improved Gravitational Search Algorithm for Minimum Variance Distortionless Response Beamforming Optimum [O] . Soodabeh Darzi, Sieh Kiong Tiong, Mohammad Tariqul Islam, -1

机译：一种面向经验的收敛性引力搜索算法用于最小方差无失真响应波束成形优化
7. Optimal distribution network configuration using improved backtracking search algorithm [O] . Thuan Thanh Nguyen 2020

机译：使用改进的回溯搜索算法的最佳分配网络配置

An Improved Search Algorithm for Optimal Multiple-Sequence Alignment

摘要

著录项

相似文献

相关主题

期刊订阅